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13.  ABSTRACT 


i'  This  report  gives  a  detailed  treatment  of  the  theory  and 
implementation  of  a  speech  waveform  coding  technique  known 
as  subband  coders  whose  operations  include  the  partitioning 
of  the  input  speech  band  into  subbands  and  the  quantization 
of  subbands  individually.  Both  the  integer-band  sampling 
and  the  quadratxire  mirror  filtering  (QMF)  approaches  are 
studied  and  the  result  Indicates  that  the  latter  method 
yields  much  Improved  speech  quality  as  compared  to  the 
former  one. 

In  addition  to  the  design  of  bandpass  filters  amd 
subband  quantizers,  practical  aspects  in  the  implementation 
of  subbfuid  coders  are  considered  emd  they  include 

cl)  a  more  efficient  digital  interpolation  scheme 
f2)  a  high  speed  multiplier-accumulator,  ^ 
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SECTION  I 
Sununary 

1.1  Sununary  of  the  Program 

Under  the  six-month  Subband  Coder  Study  Contract,  (DCA  100- 
79-C-OOOl) ,  GTE  Sylvania  developed  real-time  software  for  a  full- 
d  iplex  9.6  Kb/s  and  a  half-duplex  16,0  Kb/s  speech  transmission 
scheme  using  the  subband  encoding  technique.  This  software  was 
designed  to  operate  on  the  two  GTE  Sylvania  Programmable  Signal 
Processors  (PSP)  already  owned  by  the  Defense  Communications 
Agency. 

This  study  and  implementation  effort  has  resulted  in  a  number 
of  significant  accomplishments  in  developing  speech  processing 
algorithms  and  hardware.  Among  them,  the  most  important  ones  are: 

1)  the  development  of  real-time  software  for  the 
9.6  and  16.0  Kb/s  subband  coders 

2)  the  development  of  a  high  speed  programmable 
multiplier-accumulator  (MULACC) 

3)  the  development  of  a  novel  digital  interpolation 
scheme 

4)  the  development  of  an  improved  subband  coder 
that  employs  quadrature  mirror  filters  (QMF) 

The  Subband  Coder  Study  was  motivated  by  initial  Bell 
Laboratories  simulations  which  showed  that  the  algorithm  has 
potential  of  providing  good  performance  with  reasonable  complexity 
at  data  rates  between  9.6  and  16.0  Kb/s.  Preliminary  listening 
tests  indicated  that  the  subband  coder  yielded  better  speech 
quality  than  the  continuously  variable  slope  delta  modulation 
(CVSD)  of  the  scune  data  rate.  However,  to  evaluate  the  algorithm 
fully,  a  large  number  of  processed  sentences  is  necessary.  Since 
it  is  impractical  to  generate  such  a  data  base  using  FORTRAN 
floating  point  simulations,  real-time  implementation  of  the 
subband  coder  algorithm  on  the  PSP’s  provides  a  viable  solution. 

Though  the  subband  codier  is  conceptually  simple,  programming 
of  the  algorithm  on  the  PSP  using  fixed-point  arithemtic  is  non¬ 
trivial  based  on  the  limitations  of  the  machine's  computational 
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speed  and  16-bit  accuracy.  One  such  problem  arises  when  the 
exact  Bell  Lab's  band-pass  filters  have  to  be  utilized  in  order 
to  duplicate  their  simulations.  These  filters,  in  general,  are 
characterized  by  their  sharp  cutoffs  and  flat  passband  responses. 
Unfortunately,  each  of  them  is  200  taps  long  and  this  increases 
tremendously  the  processing  requiremt'nts.  Furthermore,  the 
fixed-point  16-bit  arithmetic  hardware  in  the  PSP  does  not 
provide  enough  accuracy  in  performing  the  bandpass  filtering 
operation  due  to  intermediate  product  truncations  and  roundoffs. 

To  circumvent  the  mentioned  difficulties,  GTE  Sylvania  designed 
a  high-speed  multiplier-accumulator  (MULACC)  which  performs  a 
16  X  16  bit  multiplication  with  a  35-bit  product  accumulation  in 
208  nsec  (2  PSP  machine  cycles) .  Occupying  two  PC  cards,  this 
circuitry  communicates  to  the  PSP  CPU  via  the  I/O  buses.  One 
apparent  advantage  of  the  hardware  is  that  the  PSP's  basic  design 
remains  unchanged  and  existing  software  is  still  operable  on  the 
modified  machines.  Moreover,  the  speed  and  accuracy  of  MULACC 
permits  the  real-time  implementations  of  9.6  and  16.0  Kb/s  subband 
coders  on  the  PSP's. 

To  further  simplify  the  complexity  of  the  algorithm,  GTE 
Sylvania  developed  a  new  digital  interpolation  scheme  which  is 
capable  of  reducing  the  number  of  multiplications  needed  by  50%. 
Even  though  the  method  is  not  utilized  in  the  real-time  implementa¬ 
tions,  it  can  be  directly  applied  to  other  signal  processing  areas 
that  involve  digital  interpolations. 

Upon  the  completion  of  the  real-time  programming,  extensive 
listening  tests  were  performed  on  the  processed  outputs.  The 
results  indicate  that  the  speech  quality  of  the  subband  coder  is 
slightly  "hollow"  and  noisy  especially  for  the  9.6  Kb/s  system, 
but  its  overall  quality  is  significantly  better  than  that  of 
CVSD.  However,  when  compared  to  other  high  quality  16  Kb/s 
speech  encoding  schemes,  such  as  the  Adaptive  Predictive  Coder 
with  Adaptive  Quantization  (APCQ)  developed  for  DCA  under  Contract 
DCA-100-76-C-0002 ,  the  subband  coder  is  clearly  inferior.  This 
can  be  partly  attributed  to  the  fact  that  the  four  or  five  band¬ 
pass  filters  employed  by  the  subband  coder  algorithm  are  not 
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adequate  to  represent  the  entire  speech  spectrum.  Tnis  intro¬ 
duces  spectral  "gaps"  which  manifest  the  "hollowness"  in  the 
processed  speech.  Moreover,  the  ratio  of  subband  energies 
fluctuates  largely  from  frame  to  frame  and 

quantization  of  them  using  adaptive  PCM  (APCM)  with  fixed  bit 
allocations  is  suboptimal. 

Armed  with  this  new  information,  GTE  Sylvania  developed  a 
FORTRAN  simulation  of  an  improved  subband  coding  scheme  using 
quadrature  mirror  filters  (QMF) .  Utilizing  a  3-stage  QMF 
structure,  the  input  speech  band  is  split  into  8  subbands  which 
can  be  recombined  perfectly  to  form  the  original  one.  No 
spectral  distortions  are  introduced  in  the  bandsplitting  and 
reconstruction  processes  even  with  a  relatively  short  filter. 
Furthermore,  to  enable  a  more  efficient  encoding  of  the  subband 
waveforms,  an  adaptive  bit  allocation  scheme  is  incorporated 
which  assigns  different  quantizer  bits  according  to  the  energies 
of  the  subbands.  As  compared  to  the  original  subband  coder, 

FORTRAN  simulations  of  the  QMF  approach  indicate  that  the  processed 
speech  is  much  improved  and  especially  at  16  Kb/s,  high  quality 
speech  is  resulted. 

Because  of  the  improved  voice  quality  provided  by  the  QMF 
subband  coder,  further  work  should  be  performed  to  refine  the 
algorithm  and  implement  it  in  real-time  on  the  modified  PSP's. 
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SECTION  II 


SOFTWARE  SIMULATIONS 


2.1  Introduction 

The  subband  coding  algorithms  investigated  under  this 
contract  are  generally  classified  as  waveform  coding  techniques 
which,  in  contrast  to  source  encoding  methods  that  model  only 
the  sources  of  speech  production,  utilize  speech  waveform  proper¬ 
ties,  such  as,  distributions  of  amplitude  and  power,  nonflat 
characteristics  of  speech  spectra.  Moreover,  though  quantization 
is  applied  to  actual  time  waveJ’orms,  subband  coding  is  cate¬ 

gorized  as  a  frequency  domain  technique  .  This  is  due  to  the 
fact  that  the  input  speech  band  is  partitioned  into  several  fre¬ 
quency  components  known  as  subbands  and  they  are  individually  en¬ 
coded.  By  varying  dynamically  the  number  of  bits  utilized  to  quan¬ 
tize  each  subband,  the  overall  available  bits  can  be  distributed 
such  that  the  Important  frequency  component  is  quantized  with  more 
accuracy.  In  the  case  of  speech  signals,  perceptually  Important 
acoustic  attributes,  such  as  pitch  and  first  formant  which  arc 
located  at  low  frequency  subbands,  are  preferentially  encoded  witli 
more  bits.  However,  for  upper  subbands  where  fricatives  or  unvoiced 
sounds  occur,  encoders  with  fewer  bits/sample  are  utilized  since 
these  subbands  contribute  less  perceptually.  An  obvious  advantage 
of  subband  coding  over  coding  in  full  band  is  that  different 
quantizers  are  employed.  As  a  result,  the  quantization  noise  is 
confined  to  each  individual  subband  and  this  prevents  the  masking 
of  signals  in  one  frequency  band  by  quantization  noise  of  another. 

2.2  Operating  Principles  of  Subband  Coder 

Operations  of  the  subband  coder  Include  the  partitioning  of 
the  input  speech  band  into  N  subbands  using  bandpass  filters, 

...  in  the  manner  shown  In  Figure  2-1.  Each  of  the  resulting 
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subbands,  after  frequency  translation  to  DC,  is  digitally  encoded 
for  transmission.  At  the  receiver,  these  transmitted  data  are  de¬ 
coded  and  later  modulated  upwards  to  their  corresponding  center 
frequencies.  After  bandpass  filtering,  the  resulting  subbands  are 
then  added  together  to  form  an  estimate  of  the  input  signal  . 


2.2.1  Integer-Band  Sampling 

To  perform  the  frequency  translations  of  the  subband  spectra, 
generally  a  complicated  modulation/demodulation  scheme  has  to  be 
applied.  Fortunately,  through  a  new  technique  known  as  integer-band 
sampling,  this  modulation  process  can  be  avoided [33 .  This  method 
calls  for  the  filtering  of  the  input  signal  into  subbands  of  band- 
widths  followed  by  the  resampling  of  the  outputs  at  their  cor¬ 
responding  Nyqulst  rates  2W^ .  At  the  receiver,  the  reverse  opera¬ 
tion  is  performed  which  reconstructs  the  input  waveform  through 
upsampling  the  decoded  subbands  followed  by  bandpass  filtering. 
Though  the  integerband  sampling  technique  alleviates  the  use  of  any 
complex  modulation  schemes,  the  choice  of  subbands  is  rather  re¬ 
stricted.  Moreover,  to  prevent  spectral  aliasing,  the  subbands 
have  to  lie  between  frequencies  kW^  and  (ktl)W^  where  k  is  an 
integer . 


For  explanatory  purposes,  the  bandpass  filtering  of  one  sub¬ 
band  is  detailed  in  Figure  2-2(a).  If  y(n)  denotes  the  bandpass 
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signal  sampled  originally  at  f^  =  and  Y(e‘'  ),  the  spectrum  of 


y(n),  is  depicted  in  Figure  2-2(b). 
given  by: 


Tlie  resampled  signal  y^(n)  is 


y^(n)  =y(Mn)  =  y 


2W 


n 


(2-1) 


1 

«>» 


where  M  = 


2W, 


is  defined  as  the  decimation  ratio.  Consequently, 


y^(n)  has  a  ^  lower  sampling  rate  f^ 

X  s 


=  1_ 

T^ 


Defining  a  new  sequence  w(n)  as; 
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M-1  .  2tTiln 

or  w(n)  =  2^  £  o'"  «  (2-3) 

£*0 

The  sequence  y^(n)  can  be  rewritten  as: 

y^(n)  =  w(Mn)  (2-4) 

To  compare  the  spectra  of  y^(n)  and  y(n),  the  Z-transform  of 
Eq.  (2-4)  Is  taken  which  yields  : 


Y^(Z)  =  ^  w(Mn)Z"" 
n=-oo 

n=-oo 


M-1 

^  y(n)e 

2Tr£n  \ 

=  -  r  1 

M  1 

M  J 

£=0 

^  n=-««> 

/ 

M-1  2tt£  \ 

=  ij:  Y  e'" 

£=0  '  ' 

The  Fourier  transform  is  obtained  by  evaluating  the  Z-transform 
shown  in  Eq.  (2-5)  on  the  unit  circle  and  this  results  in: 
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y^(eJ“T')  -  i  f;  Y  (  eJ  ) 


(2-6) 


From  Eq.  (2-6),  Y^(e'^’^'^  )  depicts  a  repeated  version  of  Y(e‘^'^'^  ) 
and  a  graphical  representation  of  it  is  shown  in  Figure  2-2(c). 

To  prevent  spectral  aliasing,  it  is  clear  that  the  subband  fre¬ 
quency  has  to  lie  between  kW^  and  (k+l)W^  where  k  is  an  integer 
(k=2  in  Figure  2-2).  This  Imposes  a  constraint  on  the  choice  of 
subbands  which  affects  the  performance  of  subband  coders  tremen¬ 
dously  . 

At  the  receiver,  the  decimated  signal  y^(n)  is  Interpolated 
upwards  M  times  in  order  to  achieve  the  original  sampling  fre¬ 
quency  of  f  =  i.  In  fact,  every  (M-1)  zeros  are  inserted  between 
consecutive  y(n)'s  and  the  resulting  sequence  v(n)  is  defined  as: 


v(n)  = 


y^(n/M)  ;  n*0,  +M,  +2M,  .. 


(2-7) 


otherwise 


To  see  the  relationships  between  the  spectra  of  v(n)  and  y^(n), 
take  the  Z-transform  of  v(n)  and  this  yields: 


V(Z)  =  y^(n/M)Z""  =  ^  y^(n)Z' 


(2-8) 


=  Y^(Z"*) 


The  Fourier  transform  of  Eq.  (2-8)  is: 

v(eJ'''').  Yi  (eJ“™).  ) 


(2-9) 


where  T^  is  the  sampling  period  of  the  decimated  waveform  y-(n). 

1  wT 

A  graphical  representation  of  7(0*^  )  is  shown  in  Figure  2-2(d). 


!■ 


From  Eq.  (2-9),  the  spectrum  of  v(n)  tias  the  Identical  shape  as 
that  of  y^(n)  after  normalization  with  respect  to  their  sampling 
rates.  By  bandpass  filtering  v(n),  a  replica  of  y  (n)  is  created 
as  shown  in  Figure  2-2(e). 

2.2.2  Characteristics  of  Band  Pass  Filters 

Utilizing  the  integer-band  sampling  technique,  the  divid¬ 
ing  of  input  speech  band  into  subbands  is  accomplished  with  a  set 
of  bandpass  filters  whose  passbands  are  chosen  to  span  the  entire 
speech  band  with  minimal  overlap.  Specifications  of  9.6  and 
16.0  Kb/s  subband  filters  and  quantizer  parameters  are  summarized 
in  Tables2-1  and  2-2.  For  the  9.6  Kb/s  system,  there  are  four 
subbands  which  cover  ranges  from  2^0-2880  Hz.  Each  of  the  filters 
is  200  taps  long  and  is  characterized  by  a  relatively  flat  pass- 
band  and  a  short  transition  region.  The  coefficients  of  the  band¬ 
pass  fl Iters, furnished  by  R.  Crochlere  of  Bell  Labs,  are  shown 
in  Appendix  A.  The  overall  magnitude  response  of  the  9.6  Kb/s 
system,  obtained  by  combining  the  responses  of  the  ^  filters,  is 
shown  in  Figure  2-3  and  it  reveals  large  spectral  gaps  at  fre¬ 
quencies  around  1000  Hz  and  1800  Hz  which  accounts  for  the  "hollow" 
and  reverberent  quality  in  the  processed  speech.  As  for  the 
16.0  Kb/s  system,  five  subbands  which  span  frequencies  from  178- 
3200  Hz  are  utilized.  These  filters  are  also  200  taps  long  and 
their  coefficients  are  tabulated  in  Appendix  B.  The  overall 
magnitude  response,  obtained  by  combining  the  responses  of  five 
filters ,  is  shown  in  Figure  2-^.  As  compared  to  tliat  of  the 
9.b  Kb/s  system.  Figure  2-^^  shows  a  narrowing  of  the  gaps  in  the 
reconstructed  spectrum  which  results  in  Improved  speech  quality. 
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2.2.3  Encoding  of  Subband  Signals 

It  is  well  known  that  the  long-term  spectrum  of  speech 
signals  tends  to  decrease  In  power  density  with  frequency, 
yet  the  drop  Is  relatively  small  for  waveforms  aci’oss  any  one 
subband.  In  other  words,  after  frequency  translation  of  the 
bandpassed  input  to  DC  and  resampling  it  at  the  Nyqulst  rate, 
the  signal  has  little  correlation  between  adjacent  samples 
and  encoding  of  it  was  shown  to  be  best  performed  using 
adaptive  PCM  (APCM)I^.  The  most  common  scheme  is  the  step- 
size  adaptation  strategy  which  utilizes  one-word  step  size 
memoryt^J.  This  technique  is  dealt  with  in  great  depth  in 
recent  literature  and  it  will  not  be  treated  in  detail  here. 
Basically,  a  B-bit  quantizer  (where  B  is  an  integer)  is 
employed  to  encode  the  input  sample  into  one  of  the  possible 

D 

2  LEVELS.  Then  the  quantizer  updates  its  step  size  after 
every  input  sample.  In  general,  these  quantizers  are  referred 
to  as  mid-rise  quantizers  and  an  example  is  a  3-bit 
quantizer  whose  characteristic  is  shown  in  Figure  2-5(a). 

Also,  multiplication  factors  for  2,  3,  and  4-blt  quantizers 
are  listed  in  Table  2-3. 

In  practice,  minimum  (Amin)  and  maximum  (Amax)  values 
have  to  be  Imposed  on  the  step  sizes  to  assure  proper 
quantizer  adaptation.  Hence,  an  obvious  disadvantage  of 
mid-rise  quantizers  is  that  it  cannot  represent  a  zero  level. 
For  small  amplitude  inputs,  the  quantized  values  always 
oscillate  between  +Amln  which  are  perceived  as  a  low  level 
tone.  In  the  case  of  subband  coders,  this  low  level  tone 
will  be  frequency  translated  back  in  the  middle  of  the 
speech  band  where  they  are  particularly  noticeable.  Hence, 
even  with  a  small  Amin,  the  distortion  is  still  very  dis¬ 
turbing. 

One  method  to  alleviate  the  low  volume  tones  is  to 
make  use  of  the  mid-tread  quantizer  An  example  is  the 

7-level  quantizer  whose  characteristic  is  shown  in  Figure 
2-5(b).  Since  the  quantizer  has  a  zeroth  step,  all  low  level 
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Inputs  are  then  coded  to  be  zero  and  this  eliminates  the  tones. 
Hence,  mid-tread  quantizers  seem  advantageous  for  subband 
coders.  However,  in  contrast  to  a  mid-rise  one,  a  B-bit  mid- 

g 

tread  quantizer  has  less  than  2  levels  and  this  represents 
a  decrease  in  coding  efficiency.  For  subband  coders,  mid¬ 
tread  quantizers  can  still  be  employed  without  losing  much  effi¬ 
ciency.  This  is  accomplished  by  utilizing  a  mid-rise/mid¬ 
tread  dequantizer  switch  at  the  receiver.  In  this  case,  the 
transmitter  employs  conventional  B-bit  mid-rise  quantizers 
whereas  the  receiver  can  select  either  mid-tread  or  mid-rise 
to  dequantize  the  data.  As  a  matter  of  fact,  a  step-size 
threshold  value  (Ath)  is  setup  in  such  a  way  that  a  mid-tread 
dequantizer  is  selected  if  A  <  Ath  whereas  the  mid-rise 
dequantizer  is  utilized  for  A  >  Ath.  Threshold  values  for  the 
9.6  and  16.0  Kb/s  subband  coders  are  experimentally  determined 
and  they  are  Included  in  Tables  2-1  and  2-2. 


STEP  SIZE 

MIIT  TTPT.TPATTON 

QUANTIZER  E 

ITS 

FACTOR 

3 

2 

Ml 

0.9 

0.85 

0.85 

M  2 

0.9 

1.0 

1.9 

M  3 

0.9 

1.0 

M  4 

0.9 

1.5 

M  5 

1.2 

M  6 

1.6 

M  / 

2.0 

M  8 

2.4 

TABLE  2-3  APCM  CODER  PARAMETERS 


2.3  Practical  Considerations  Inthe  Implementation  of  Subband 
Coders 

The  subband  coders  as  discussed  in  Section  2.2  employ  a 
bank  of  ^  or  5  FIR  filters  to  perform  bandpass  filtering.  One 


obvious  advantage  is  due  to  the  linearity  of  the  FIR  filter’s 
phase  response  with  respect  to  frequency  which  eliminates  dis¬ 
tortions  Introduced  by  time  delays  of  different  subbands.  How¬ 
ever,  to  achieve  the  desired  bandpass  characteristics,  length  of 
FIR  filters  is  generally  long  (e.g.,  200  taps)  thus  increasing  the 
processing  requirements.  In  the  real-time  implementations  of  sub¬ 
band  coders,  the  success  is  greatly  dependent  on  the  efficient 
programming  of  the  bandpass  filtering  operation  and  the  following 
sections  consider  in  more  detail  the  number  of  mathematical  opera¬ 
tions  (in  particular,  the  number  of  multiplies)  required. 

2.3.1  Decimation  Filtering 

The  ou’-puts  of  a  bandpass  filter  is  given  by  the  convolu¬ 
tion-sum 


N-1 

y(n)sT*  h(k)x(n-k)  (2-10) 

k-0 

where  N  is  the  order  of  the  filter,  h(k)  is  the  k^^  filter  co¬ 
efficient,  and  x(n),  y(n)  represent  the  input,  output  sample, 
respectively.  It  is  clear  from  Eq.  (2-10)  that  N  multiplications 
are  needed  to  compute  each  y(n).  However,  the  integer-band  sampl¬ 
ing  technique  calls  for  the  down-sampling  of  the  filtered  output 

t  h 

to  obtain  y^(n).  In  other  words,  only  every  point  has  to  be 
calculated  where  M  is  the  decimation  ratio  and  Eq .  (2-10  )  can  be 
re-written  as: 


N-1 

yj^(n)  =  y(Mn)  =  ^  h(k)x(Mn-k)  (2-11) 

k=o 

Henceforth,  the  total  number  of  multiplies  required  for  each  input 
sample  becomes  N/M.  Furthermore,  since  the  Impulse  response  of  a 
FIR  filter  satisfies: 


h(n)  “  h(N-l-n) 


{2-12) 


Kq.  can  be  shown  as  (for  N  even): 

N 

yj^(n)  =»y(Mn)  ■  h(k)  ^  x(Mn-k)  +  x(Mn-N+k  +  l)^  (2-13) 

k*o 

Then  the  symmetry  of  the  filter  reduces  the  number  of  multiplies 
by  50$  (or  N/2M  per  input  sample).  The  above  derivation  can  also 
be  applied  to  the  case  when  N  Is  odd.  Figure  2-0  Illustrates  the 
structure  that  e xplolts  the  symmetry  property  of  the  FIR  filters. 
Table  2- 4  summarizes  the  number  of  multiplies  needed  for  each  band 
to  perform  decimation  filtering  In  the  9.6  and  l6.0  Kb/s  subbatul 
coders . 

2.3.2  Interpolation  Filtering 

There  are  a  number  of  methods  which  may  be  used  to  perform 
interpolation  filtering.  The  most  straightforward  method  Is  to 
pad  in  (M-1)  zeros  between  each  decimated  sample  value  where  M  In 
this  case  the  interpolation  ratio,  and  bandpass  filter  the  result¬ 
ing  sequence.  Using  this  technique,  the  filter  structure  as  shown 
In  Figure  2-6  is  also  capable  of  performing  interpola¬ 
tion  filtering.  Though  the  symmetry  of  linear  phase  FIR  filters 
can  be  employed  to  decrease  the  number  of  multiplies,  the  amount 
of  multiplications  required  (N/2  per  Input  sample)  Is  still 
excessive.  However,  cognizant  of  the  fact  that  there  are  no 
multiplications  Involved  with  samples  of  zero  values,  this  numbei’ 
can  be  greatly  reduced.  Referi-lng  to  Figure  2-2(a),  the  synthesized 
sample  ^(n)  Is  shown  as 

N-1 

y(n)  -  h(k)y(n-k)  (2-l't> 

k«o 
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4 

#  OF  MULTIPLIES 

«  OF  MULTIPLIES 

PER  FRAME 

PER  FRAME 

IN  9.6  KB/S 

IN  16.0  KB/S 

SUBBAND  « 

SYSTEM 

SYSTEM 

1 

900 

300 

- 

2 

1800 

500 

3 

2000 

900 

•m 

4 

3600 

1800 

5 

1800 

- 

TOTAL 

8300 

5300 

4 


TABLE  2-4  NUMBER  OF  MULTIPLIES  PER 

FRAME  FOR  DIGITAL  DECIMATION 


4 
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Substituting  Eq .  (2-7)  Into  (2-li<),  y(n)  can  be  rewritten  as 
N-1 

'  E  h(k)y,  r  !t!il  (j-15) 

k-o  L  M  J 

where 

(5;  If  i  Is  an  Integer 

(2-16) 

0;  otherwise 

Using  this  technique,  the  number  of  multiplies  required  per  Input 
sample  as  shown  In  Eq .  (2-15)  is  equal  to  the  integer  quotient 
f N/M) .  The  total  number  of  multiplies  per  frame  for  Interpolation 
fllterlne  In  the  subband  coders  is  shown  In  Table  2-5. 

2.3«3  A  Novel  Technique  In  Digital  Interpolation 

If  Table  2- 5  Is  compared  to  Table  2-^,  It  is  shown  that  the 
Interpolation  filtering  requires  twice  the  number  of  multiplies  as 
that  of  decimation  filtering.  This  can  be  attributed  to  the  fact 
that  the  symmetry  of  FIR  filters  cannot  be  exploited  In  the  Inter¬ 
polation  case  M.  However,  by  rearranging  and  combining  some  of 
the  Input  samples,  the  symmetry  of  the  filter  can  Indeed  be 
emnloved  to  reduce  the  total  number  of  multiplies  at  the  expense 
of  more  additions  (subtractions). 

Defining 

(?-17) 

where  {•}  Is  the  Integer  quotient,  N  Is  the  filter  length  and  M  Is 
the  decimation  ratio,  the  remainder  a  can  be  written  as: 

01  »  N-1  -  mM  (2-l8) 
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SUBBAND  § 

#  OF  MULTIPLIES 

PER  FRAME 

REQUIRED  IN  9.6  KB/S 
SYSTEM 

#  OF  MULTIPLIES 

PER  FRAME 

REQUIRED  IN  16.0  KB/S 
SYSTEM 

1 

1,800 

630 

2 

3,600 

1,080 

3 

i<,000 

1,800 

4 

7,200 

3,600 

5 

3,600 

TOTAL 

16,600 

10,710 

TABLE  2-5  NUMBER  OP  MULTIPLIES  PER 
FRAME  FOR  INTERPOLATION 
FILTERING 
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Then  Eq.  (2-15) >  flrst^M  samples,  can  be  expanded  as: 


Making  use  of  the  symmetry  of  h(n)  as  defined  in  Eq.  (2-12), 

'^1>V\^=  ^  U') 

(2-20) 

ilo  "|=  ^VN-v) 

Substituting  Eq.  (2-20)  into  (2-19),  the  following  equations  are 


2-a 


resulted : 


I 

1 

2 

a 

3 


4^- 


I 


l^i<)  -  ^^10)  -V  ^  -^Uf  »^-'')  ^a-vA^ 

# 

I 

VJ,^>^^^MV^^.^Ar*^'^,^-l'^V  ■  •  ■  +  ■^10)^, (-m’) 

(2-21) 


•  » 


i 

#» 


t 

4i» 


I 

I 

1 


A  closer  examination  of  Eq.  (2-21)  reveals  great  similarity  between 
sample  pairs  y(0)  and  y(a);  y(l)  and  y(a-l)  ...  In  other  words, 
if  they  are  combined,  the  following  is  true: 


2-22 


K*.\  =  . 


(2-22) 


where 


denotes 


the  largest 


Integer  of 


Hence  from  Eq.  (2-22), 


Uo-yr  Vo 
KJ^\\  -  (Xy  V  V  ^ 


(2-23) 

y-i)  =  a^- 

-  (Xo-  bt> 

Since  it  only  takes  +1  multiplies  to  obtain  a^  or  b^,  the 

total  number  of  multiplies  for  each  y(k)  is  also  +1  which 

is  about  half  the  number  required  by  the  conventional  method  as 
''.hown  in  Eq.  (2-15).  Similarly,  y(a+l),  y(M-l);  y(a+2),  y(M-2); 

...  can  be  combined  in  the  same  fashion  yielding: 


2-23 


aiiu 

y*'il  = 

^  \\\^\  - 

As  shown  in  Eq.  (2-25),  it  takes  multiplies  to  compute  each 

y(k)  where  k  =  a+1,  a+2,  ...  M-1  whlcn  is  also  half  the  number  re¬ 
quired  by  the  conventional  way.  Then  the  total  number  of  multiplies 
needed  to  compute  all  M  points  is  given  by 

•'  (2-2ci) 

^  +  UM') 

As  an  example,  the  filter  structure  needed  to  compute  a  ,  b  (or 

^  o  o 

equivalently  y(0),  y(ot))  is  depicted  in  Figure  2-7.  To  compute 

/>  A 

a^,  b^  (or  y(l),  y(a-l),  the  same  filter  structure  with  a  different 
coefficient  set  can  be  utilized. 

Furthermore,  to  compute  a  new  block  of  output  data,  y(k)*s 
can  be  expanded  as: 


^lM=Vv\y)  V  •  • 


•  .  .  +KltA-')^  ,l"M+2.^ 

•  (2-27) 


By  comparing  Eqs.  (2-27)  with  (2-19),  it  is  clear  that  both  sets 
are  identical  with  the  only  exception  that  the  input  sequence  is 
incremented  by  1..  Hence  the  filter  structure  as  shown  in  Figure 
2-7  can  be  used  with  the  input  y(l)  to  the  filter.  Repeating  the 

i  •  A 

above  procedure,  all  interpolation  outputs  y(0),  y(l),  ...  can 
be  obtained.  To  show  the  efficiency  of  the  new  algorithm,  the 
number  of  multiplications  required  for  the  9-d  and  l6.0  Kb/s 
subband  coders,  summarized  in  Table  2-6,  shows  that  only  half  the 
'  number  of  multiplies  is  needed  as  compared  to  that  of  the  conven¬ 
tional  technique.  Furthermore,  utilizing  this  method,  the  mulipli- 
cation  necessary  for  both  digital  decimation  and  Interpolation  are 
roughly  equivalent. 
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TABLE  2-6  NUMBER  OF  MULTIPLIES  PER 
FRAME  FOR  DIGITAL  INTER¬ 
POLATION  WITH  NEW  TECHNIQUE 


2.4  Alternative  Subband  Coding  Techniques 

From  previous  sections,  it  has  been  shown  that  subband  coder 
is  a  conceptlonally  simple  speech  encoding  scheme  which  deals  ma^inly 
with  the  partitioning  of  input  speech  band  into  different  fre¬ 
quency  bands.  As  detailed  in  Section  2.2,  one  approach  is  to  make 
use  of  finite-impulse  response  (FIR)  filters  to  perform  the  band¬ 
pass  fl Itering  0 peratlon .  However,  to  achieve  the  necessary 
bandnass  characteristics,  it  is  shown  that  large  order  filters  are 
generally  needed  and  this  increases  the  processing  burden.  More¬ 
over,  the  combined  frequency  response  of  the  filter  bank  may  not 
be  desirable  as  a  result  of  the  spectral  gaps  as  depicted  in 
Figures  2-3  and  2-4.  Hence,  further  improvements  on  performance 
or  simplifications  on  the  implementation  of  subband  coders  hinge 
on  the  utilization  of  other  bandpass  filters. 

2.4.1  The  Recursive  Filter  Approach 

Since  low  order  recursive  filters,  somtlmes  referred  to  as 
infinite-impulse  response  (HR)  filters,  are  known  to  yield  magni¬ 
tude  response  comparable  to  that  of  long  FIR  ones,  it  would  seem 
feasible  to  use  HR  filters  in  performing  the  bandpass  filtering 
procedures  in  the  subband  coder.  Unfortunately,  nonuniform  group 
delay  due  to  the  nonlinearity  of  HR  filter's  phase  response  is 
rather  undesirable  for  this  application.  Furtherm.ore ,  the  savings 
on  computations  (mainly  multiplications)  realized  through  tb.e  use 
of  a  short  filter  length  is  minimal  because  of  HR  filter's  ineffi¬ 
ciency  in  performing  digital  interpolations  and  decimations. 

To  illustrate  the  latter  statement,  the  use  of  an  HR  filter 
to  downsample  input  samples  is  considered.  If  the  Z-transform  of 
the  HR  filter  is: 


2-28 


“z 

V--0  i 


(?-?8) 


where  Is  defined  as  the  order  of  the  filter  (N^  <N),  the  rela¬ 
tionship  between  the  input  and  the  filtered  output  sample  y(n)  as 
depicted  in  Figure  2-2(a)  is; 


=  - 1  t, ^ 


(P-29) 


Then  the  downsampled  version  of  ,y(n)  is  given  by: 


f' 

where  M  is  the  decimation  ratio. 


(P-^0) 


It  is  clear  from  Eq .  (2-30)  that  in  order  to  obtain  y^(n), 
2N'-1  multiplications  have  to  be  performed  for  every  y(n).  So,  In 
contrast  to  the  use  of  FIR  filters,  the  amount  of  multiplications 
required  in  this  case  does  not  decrease  with  M.  The  use  of  HR 
filters  in  subband  coding  is  only  advantageous  over  FIR  ones  If: 


Sn'-I  ^  '^/3L^A 


(P-31) 


7/-k(^«v^') 

As  the  value  of  M  Increases,  Eq .  (2-31)  is  less  likely  to 
be  true  and  this  is  one  of  the  reasons  that  IIH  filters  may  not 
be  applicable  for  subband  coding  of  speech  signals. 


*Note:  In  a  recent  paper,  the  design  of  a  special  class  of  HR 

filters  is  given  which  can  perform  efficient  digital  decimation 
However,  the  nonlinear  phase  characteristic  of  HR  filters  is  still 
undesirable  for  subband  coding. 
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2  .  . 2  The  Quadrature  Mirror  Filter  Approach 


A  better  approach  to  subband  coding  Is  to  make  use  oT 
quadrature  mirror  filters  (QMF)  which  band-spl 1 I s/reconslructs 
the  Input  waveform  via  declmatlon/lnterpolatlon  methods  [/’I  It, 
will  be  shown  later  ttian  the  use  of  QMF  design  equations  will 
achieve  perfect  splitting/reconstruction  without  large-order 
f 1  Iters . 


For  explanatory  purposes,  consider  the  Ideal  splitting/ 
recotistruct  1  on  pi’ooess  described  In  Figure  2-8.  Foi’  this  systt'm, 
the  following  definitions  apply: 

a.  x(n)  Is  a  Nyqulst  band-limited  signal  with  ;''.-t  ransfoi'iii 
X(?.). 

b.  h^  Is  the  Impulse  response  of  the  low-pass  filter  and 
the  s-transform  of  which  Is  H^(n). 

c.  h.,(n)  Is  the  Impulse  response  of  the  high-pass  filter 

the  s-transform  of  which  la  ). 

d.  y^(n)  Is  a  baseband  equivalent  low-pass  signal  with 
s-transform 

e.  y,, (n)  Is  a  baseband  equivalent  high-pass  signal  with 
7,-transform  Y^iz). 

i- 

The  signal  x(n),  Is  processed  by  filters  h,j(n)  and  h,!!!! 
yielding  the  low-pass  and  high-pass  equivalents,  Xj(n)  and  x,!::!, 
of  the  Input  signal.  As  their  spectra  occupy  half  the  N.vuiulst 
bandwidth  of  the  original  signal,  the  sampling  rati'  In  each  band 
can  be  halved  by  decimating  (Ignoring)  every  second  sample.  For 
reconstruction,  the  signals  y,(n)  and  y,^(n)  are  Interpolat  ed ,  by 
Inserting  one  zero-valued  sample  every  other  time,  and  then  filtered 
respect  1  velyby  h,(n)  and  h,,(n)  before  being  added,  to  give  t  lie  stg- 
nal  x(n).  The  dashed  lines,  shown  in  Figure  8 ,  represent  the  data 
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passed  to  the  communication  channel(s)  by  the  speech  processing 
system . 

A 

In  order  to  minimize  (x(n)  -  x(n)),  certain  restrictions 
on  the  filters,  h^(n)  and  h2(n),  must  be  met.  We  will  derive 
these  restrictions  by  contructing  the  transfer  function  of  the 
QMF  structure. 

Using  z-transform  notation  and  referring  to  Figure  2-8, 
we  may  write  the  intermediary  filtered  output  as 

(2-3?) 

and 

Xai^  =  (2-33) 

The  transforms  of  the  decimated  signals,  y^(n)  and  y^Cn),  and  of 


the  interpolated  signals,  u^(n)  and  U2(n),  are 

given  by; 

(2-3^) 

(2-35) 

(2-3b) 

(2-37) 

After  the  final  filtering  operating,  the  transforms  of  the  recon¬ 
structed  waveform  components,  tj^(n)  and  t^Cn),  are  given  by 

T'lUV'Ad?:')  0,(1) 

(2-38) 

(2-3'-'’) 

Using  the  relations  expressed  in  (2-3^)  through  (2-39),  the  z-t rans- 
forms  can  be  rewritten  as 
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1 


Tv  UV  i  \Av(i)  (2-110) 

The  z-transform  of  the  reconstructed  waveform,  x(n)  is  obtained  by 
adding  (2-40)  eind  (2-4l) 

If  we  assume  that 

then  the  reconstructed  waveform  transform  becomes 

Evaluating  z  on  the  unit  circle  gives  the  Fourier  transform  of 
X(z) 

For  the  case  when  h^(n)  is  an  even,  symmetrical  FIH  filter  of  order 
N,  then  it  can  be  shown  that  (2-45)  reduces  to 

2  1  w  t  w 

where  (e  )  exhibits  an  odd  symmetric  property  about  s/4  and 

the  half-power  point  H^^('^s/4)  =  0.5. 

The  inverse  transform  yields  a  perfectly  reconstructed  sig¬ 
nal  (no  frequency  distortion)  with  a  gain  factor  of  1/2  and  delay 
of  N-1  samplesas  shown  by 


(2-47) 


Therefore,  we  have  shown  that  to  guarantee  perfect  reconstruction 
of  the  original  spectrum,  the  following  filter  constraints  must  be 
satisfied 

2. 4.2.1  Subband  Coding  via  QMF  Filters 

Employing  a  procedure  similar  to  the  Hermann  design  algorithm 
for  maximally  flat  FIR  filters,  a  60-tap  filter  which  satisfies  the 
QMF  constraints  as  stated  in  Eqs .  (2-48)  to  (2-50)  is  obtained 

Its  impulse  response  h^(n)  is  tabulated  in  Appendix  C  and  the  cor¬ 
responding  magnitude  response  as  depicted  in  Figure  2-9  shows  that 
the  filter  has  aflat  passband  extending  from  0  -  f  /4  Hz.  It  is 
also  characterized  by  a  fast  roll-off  with  the  3-db  point  located 
at  From  h^(n),  its  quadrature  mirror  counterpart  ^^(n)  is 

generated  using  Eq.  (2-49). 

Starting  with  these  two  filters  h^(n)  and  h2(n)  ,  a  new  form 
of  subband  coder  is  realized.  As  a  matter  of  fact,  a  3-stage  tree 
structure  as  shown  in  Figure  2-10  is  exploited  to  split  the  input 
speech  band  into  8  subbands  [10]  .  For  an  input  signal  of  3200  Hz 
bandwidth,  subbands  of  400  Hz  are  resulted.  At  the  first  stage  of 
the  transmitter,  the  QMF  filters  split  the  input  into  two  bands; 
that  is,  0-1600  Hz  and  I60O-32OO  Hz.  Then  a  downsampling  procedure 
is  utilized  to  reduce  the  number  of  samples  by  a  half.  At  the 
second  stage,  the  identical  bandsplitting  process  is  applied  to 
each  subband.  Consequently,  2  more  subbands  are  generated  each  of 
which  has  a  800  Hz  bandwidtn.  At  the  end  of  the  second  stage,  a 
total  of  4  subbands  is  obtained.  The  method  is  repeated  one  more 
time  and  eight  subbands  spanning  frequency  range  from  0  to  3200  Hz 


(2-48) 

(2-49) 

(2-50) 
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are  created.  After  performing  the  bandsplitting  process,  the 
subband  signals  are  Individually  quantized  for  transml sslon .  At 

the  receiver,  the  reverse  operation  which  Includes  up-sampling 
and  bandpass  filtering  Is  performed.  They  are  later  recombined 
to  form  an  estimate  of  the  input  signal.  So,  depending  on  tlie 
number  of  bits  utilized  In  the  quantization  of  the  subbands, 
both  the  9.6  and  Ib.O  Kb/s  subband  coders  can  be  realized. 

2. ^.2. 2  Adaptive  Bit  Allocations 

One  of  the  advantages  of  subband  coding  algorithms  is  tlie 
possibility  of  using  the  spectral  densities  of  the  suband  signals 
and  applying  nonuniform  quantization  to  each  one.  The  conventional 
approach  Is  to  use  a  flxed-blt  allocation  rule  where  a  pre-determined 
number  of  bits  is  assigned  to  the  coding  of  each  band.  In  the  QMF 
system,  a  larger  proportion  of  the  available  bits  will  be  allocated 
to  the  quantization  of  the  low  bands  In  order  to  capture  all  pitch 
and  formant  Information,  whereas  a  smaller  percentage  of  the  bits 
will  be  utilized  foi’ the  high  bands.  However,  since  the  ratio  of 
low-band  and  hlgh-band  energies  fluctuates  from  frame  to  frame, 
fixed-bit  allocation  may  not  necessarily  be  the  best  strategy. 
Instead,  an  adaptive  bit  allocation  scheme  that  dynamically  alters 
the  bit  assignments  depending  on  the  signal  energies  of  the  eight 
bands  seems  more  applicable  [nl . 

For  explanation  purposes,  let  us  assume  that  the  average  bit 
rate  to  quantize  all  subbands  Is  R  bits.  In  other  woi’ds,  If  h\  Is 
the  actual  number  of  bits  allocated  to  the  1^'  subband,  K  Is  given 
by  : 


The  nnmhpr  o^  bits  neo'^ed  to  nuantlze  the  Itli  subband  is  given  by 


I 

I 

I 

I 

1 

J 

J 

I 


^  . 


I. 


^  c 


c-S 
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,th 


subbancl,  and  the  conre- 


2  2 

where  o„  ,  a  denote  the  power  of  the  1 
<’<1  Qi 

spending  quantization  noise  power,  respectively,  and  5  is  a  cor¬ 
rection  term  which  depends  on  the  performance  of  practical  quan¬ 
tizers.  If  the  dependence  of  on  5  is  neglected.  Equation  (2-52) 
can  be  rewritten  as: 


Z  I  -16; 

Defining  the  total  quantization  noise  power  E  as 


% 


(2-53) 


(2-5^) 


adaptive  bit  allocation  calls  for  the  selection  of  for  each 
subband  such  that  E  is  minimized  provided  the  constraint  as  given 
in  Equation  (2-51)  is  satisfied.  This  minimization  procedure  can 
be  best  performed  with  the  help  of  Lagrange  multipliers  where  a 
combined  performance  index  I,  defined  as: 


1=  e+aG- 


(.’-55) 


with  X  given  as  the  Lagrange  multiplier  and  G  the  constraint 
function,  is  minimized.  G  is  defined  as: 


% 


i-\ 


(2-5b) 


Differentiating  I  with  respect  to  R^^  and  X,  and  setting  the  result¬ 
ing  equations  to  zero,  the  following  is  obtained. 
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2. 

\  -V  >  -  o 


(2-57) 


Combining  Eq.(2-57)  for  i=l,  2,  ...  8, 


'  (T.  a 


(2-58) 


Taking  logarithm  on  both  sides,  substituting  Equation  (2-57)  and 
eliminating  A,  Equation  (2-58)  becomes 


•*•1  Xj  / 


(2-59) 


Hence,  bits  to  encode  each  subband  can  be  adaptively  allocated  ac¬ 
cording  to  the  corresponding  signal  power  within  each  band. 
Furthermore,  zero  bits  are  assigned  whenever  as  obtained  from 
Eq .  (2-59)  are  negative.  To  prevent  the  allocation  of  all  tne 
bits  in  one  subband,  a  maximum  value  is  imposed  on  each  .  More¬ 
over,  the  frame  powers  for  the  eight  subbands  have  to  be  trans¬ 
mitted  so  that  the  receiver  can  decode  the  bits  via  an  identical 
scheme . 
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2.4.2. 3  Discussion  of  Results 

In  order  to  evaluate  the  performance  of  subband  coders 
using  QMF  filters,  simulations  of  the  system  are  programmed  in 
FORTRAN.  Specifications  of  the  9-6  and  l6.0  Kb/s  systems  are 
summarized  in  Table  2-7.  In  this  simulation,  a  3-stage  QMF 
filter  structure  as  shown  in  Figure  2-10  is  employed  xn  the 
bandsplitting  and  reconstruction.  With  a  filter  length  of  60 
taps,  the  delay  for  the  entire  system  is  70t(60-l)  or  413 
samples . 

To  examine  the  effect  of  the  filters,  the  system  is  imple¬ 
mented  without  quantization  and  Figure  2-11  depicts  the  overall 
frequency  response  of  the  filter  structure.  In  contrast  to  the 
integer-band  sampling  technique,  the  QMF  filters  span  the  entire 
speech  band  with  no  significant  gaps  in  the  spectrum 
and  this  results  in  the  "smoother"  quality  of  the  processed 
speech . 

To  further  improve  the  quality,  the  adaptive  bit  allocation 
algorithm  as  discussed  in  Section  2. 4. 2. 2  is  also  incorporated. 

In  order  to  prevent  the  allocation  of  too  many  bits  on  a  single 
subband , a  maximum  number  of  5  bits  is  allowed.  Moreover,  for  bands 
with  zeroth  bit  assignment,  low  level  random  noise  is  inserted  to 
eliminate  the  "low-pass"  effectC^S].  Though  the  resulting  speech 
is  slightly  noisier,  the  added  noise  seems  to  fill  the  spectral 
gaps  created  by  quantization  of  some  subbands  with  no  bits  and 
the  overall  quality  of  the  processed  speech  is  improved. 

Figures  2-12  and  2-13  show  frame-to-frame  slgnal-to-noisc 
plots  for  the  9.6  and  16.0  Kbps  QMF  subband  coder.  The  result 
indicates  that  the  QMF  systems  yield  relatively  high  signal-to- 
noise  ratios.  Also,  the  coders  seem  to  perform  the  best  when  the 
input  power  is  highest.  This  is  not  surprising  since  after  the 
bandpass  filtering,  the  subband  with  the  largest  average  amplitude 
IS  captured  with  the  most  bits  using  the  adaptive  bit  allocation 
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Sampling  Rate  =  6.4  KHz 
Frame  Length  =  144  samples 
No.  of  Frames/Sec.  =  44.4444 
Frame  Time  =  22.5  msec 

For’  the  16.0  Kb/s  system:  total  no.  of  bits  per  frame  to  quantize 

8  subbands  =  324;  Overhead  bits  per  frame  = 

For  the  9-6  Kb/s  system:  total  no.  of  bits  per  frame  to  quantize  8 

subbands  =  180;  Overhead  bits  per  frame  =  36 


SUBBAND  # 

FREQUENCY 
RANGE  IN  HZ 

DOWNSAMPLING 
(INTERPOLATION)  RATIO 

FOR  ALL  3  STAGES 

QUANTIZER 

BIT  ALLOCATIONS 

1 

0-  400 

8 

adaptive 

2 

400-  800 

8 

adaptive 

3 

800-1200 

8 

adaptive 

4 

1200-1600 

8 

adaptive 

5 

1600-2000 

8 

adaptive 

6 

2000-2400 

8 

adaptive 

7 

2400-2800 

8 

adaptive 

8 

2800-3200 

8 

adaptive 

TABLE  2-7  SPECIFICATIONS  OF  9 • 6  AND  16.0  KB/S 
QMF  SUBBAND  CODERS 


fREfUeNCy  (HZ>  0:j  0'  ■> 

120.00  160.00  ?>P.00  2ii0.00 


rule.  Informal  listening  tests  conducted  on  processed  sentences 
also  show  that  the  speech  quality  of  QMF  subband  coder  Is  im¬ 
proved  over  that  of  the  Integer-band  sampling  technique.  At 
9.6  Kb/s,  the  system  tends  to  produce  slightly  reverberent 
speech  and  however  at  16  Kb/s,  high  quality  speech  is  yielded. 
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SECTION  III 

The  Multiplier-Accumulator  Hardware 
3.1  General  Description 

In  this  contract,  GTE  Sylvania  developed  a  state-of-the-art 
multiplier-accumulator  (MULACC)  which  greatly  enhances  the  capa¬ 
bility  of  the  Sylvania  Programmable  Signal  Processor  (PSP)  in 
various  signal  processing  applications.  In  addition  to  linear 
convolutions,  the  hardware  performs  computations  of  auto-/cross- 
correlation  of  two  arrays  of  numbers  most  effectively.  The  nucleus 
of  the  design  is  a  high  speed  64-pin  multiplier-accumulator  chip 
(TRW  TDC  lOlOJ)  which  multiplies  two  16-bit  numbers  and  accumulates 
a  35-bit  product  in  115  nsec.  Moreover,  to  fully  exploit  its 
capability,  two  buffers  of  fast  RAM  memories  (MOSTEK  MK4118P)  are 
included  which  feed  data  directly  into  the  chip.  Since  the  cir¬ 
cuitry  communicates  with  the  PSP  CPU  through  input/output  buses, 
these  memory  buffers  drastically  reduce  the  passing  of  data 
between  the  PSP  and  MULACC.  Furthermore,  since  MULACC  is  treated 
as  a  peripheral,  no  hardware  changes  are  required  on  the  PSP  CPU 
and  all  existing  programs  are  still  operable  on  the  modified 
machine.  After  starting  MULACC,  the  PSP  CPU  is  also  free  to  per¬ 
form  other  tasks  and  this  represents  a  more  efficient  utilization 
of  available  processing  time.  As  an  indication  of  its  speed,  the 
MULACC  is  capable  of  accessing  the  data  from  the  two  buffers, 
multiplying  two  16-bit  numbers,  and  forming  a  35-bit  product 
accumulation  in  208  nsec  (or  2  PSP  cycles) . 

The  MULACC  hardware  as  shown  in  Figure  3-1  consists  of 
two  PC  cards.  The  first  card  (board  CON)  ^  located  at  slot  7  of 
the  PSP  nest,  is  responsible  for  interpreting  outputs 
from  the  PSP  CPU,  and  decoding  them  into  MULACC  instructions.  In 
particular,  outputs  from  channel  5  of  the  PSP  are  converted  into 
MULACC  instructions  whereas  outputs  from  channel  6  are  treated  as 
data.  Then  the  correct  calling  sequence  for  programming  MULACC 
becomes 

1)  OUT  5  INSTRUCTION 

2)  OUT  6  DATA 
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FI^-URE  3-1  BLOCK  DIAGRAM  OF  THE  HIGH  SPEED  MOLTIPLIER-ACCUMULATOr 


After  the  first  line  of  code,  the  instruction  reqister  of  MUIJ^CC 
is  loaded  and  the  corresponding  timing  and  control  information  are 
read  from  a  programmable  logic  array  (PLA) .  This  is  later  used 
to  setup  the  sequence  of  operations  required  by  the  instruction. 
From  the  second  line  of  code,  the  data  become  operands  of  the 
corresponding  memory  registers  and  loopcounters . 

The  second  PC  card  (board  ACC)  located  at  slot  5  of  the  PSP 
nest,  contains  the  two  buffer  memories  as  denoted  by  X  and  Y,  and 
the  multiplier-accumulator  chip.  Memory  address  registers  which 
are  also  present  on  board  ACC  can  be  programmed  to  point  to  any 
locations  within  the  buffers.  An  up/down  counter  is  utilized  to 
increment/decrement  the  address  registers  automatically  after  each 
memory  access.  In  the  present  design,  2K  16-bit  data  points  can 
be  pre-stored  in  X-buffer  whereas  IK  can  be  loaded  into  Y-buffer. 
After  MULACC  is  started  with  setting  of  the  loop  counter,  32-bit 
product  is  first  formed  which  is  then  added  to  the  content  of  the 
TRW  chip's  output  register.  Upon  the  termination  of  the  operation, 
a  LOAD  bit  is  set  to  a  1  and  the  35-bit  product  is  then  transferred 
to  the  PSP  input  channels  in  three  segments;  namely,  extended, 
most  significant,  and  least  significant  products. 

3.2  Programming  of  MULACC 

As  discussed  in  Section  3,1,  procedures  for  programming  MULACC 
include  the  outputting  of  an  instruction  code  from  channel  5  of 
the  PSP,  followed  by  an  output  of  data  from  channel  6.  From  the 
first  output,  the  instruction  register  of  MULACC  is  loaded  and  the 
second  output  supplies  the  operand  data.  MULACC  instructions, 
which  include  the  loading  of  X  and/or  Y  buffer  addresses  and 
memories,  the  selection  of  multiplier  functions,  and  loading  of 
multiplier  loop  counter  are  listed  in  Table  3-1. 

As  for  multiplier  functions,  the  hardware  permits  the  choice 
of  12  functions;  namely,  unsigned  magnitude/2 ' s  complement  arith¬ 
metic,  multiplication  with/without  rounding,  mul tipi ication  with/ 
without  accumulation,  multiplication  with/without  subtraction, 
increment/decrement  of  X-buffer  pointers,  and  increment/decrement 
of  Y-buffer  pointers.  Bit  settings  of  the  corresponding  functions 


TABLE 

*>  1  Ml|» 

or';  1  y  PfTT  JOuS 

INSTRUCTIONS 

HEX 

VALUES 

COMMENTS 

Idle  (IDLE) 

0 

An  output  of  value  0  to  channel  5  will 
set  the  MULACC  to  an  idle  state 

Load  X-Buffer  Address 
(XADR) 

1 

Outputs  of  value  1  to  channel  5,  followed 
by  the  number  XXXX  to  channel  6  will 
point  to  address  XXXX  of  X-Buffer 
(0<  XXXX<  2047). 

Load  Y-Buffer  Address 
(YADR) 

2 

Outputs  of  value  2  to  channel  5  followed 
by  the  number  YYYY  to  channel  6  will  point 
to  address  YYYY  of  Y-Buffer 
(0  £  YYYY  £  1023). 

— 

Load  Identical  X-Buffer 
and  Y-Buffer  addresses 
(ADR) 

3 

Outputs  of  value  3  to  channel  5,  followed 
by  the  number  XXXX  to  channel  6  will 
set  both  X  and  Y-Buffer  address  pointers 
to  XXXX. 

Load  X-Buffer  Memory 
(LDX) 

4 

Outputs  of  value  4  to  channel  5,  followed 
by  the  number  DDDD  to  channel  6  will  load 
data  DDDD  into  the  pointing  X-Buffer 
address  (see  Note  1  in  Table  3-4). 

Load  Y-Buffer  Memory 
(LDY) 

5 

Outputs  of  value  5  to  channel  5,  followed 
by  the  number  DDDD  to  channel  6  will  load 
data  DDDD  into  the  pointing  Y-Buffer 
address  (see  Note  1  in  Table  3-4). 

Load  Identical  Values 
into  X  and  Y-Buffer 

Memories  (LXAY) 

6 

Outputs  of  value  6  to  channel  5,  followed 
by  the  value  DDDD  to  channel  6  will  load 
data  DDDD  into  the  pointing  X  and  Y- 
Buffer  addresses  (see  Note  1  in  Table  3-4). 

Select  Functions  (TASK) 

7 

Outputs  of  value  7  to  channel  5  followed 
by  the  value  DDDD  to  channel  6  set  up  the 
MULACC  to  perform  task  DDDD.  (Refer  to 
table  3-2  for  specific  task  definitions.) 

Read  Multiplier  (READ) 

8 

An  Output  of  value  8  to  channel  5  followed 
by  a  waiting  period  of  4  cycles  will 
enable  the  products  presently  residing  at 
the  MULACC  to  be  read  (see  Note  2  and  refer 
to  Table  3-3  for  specific  reads). 

Load  Loop  Counter  A 

Start  Mul tipi ier 
(LOOP) 

9 

Ouputs  of  value  9  to  channel  5  followed 
by  the  number  (255-nnn)  to  channel  6  will 
set  up  the  MULACC  to  multiply  nnn  times. 

The  maximum  number  allowed  is  255. 

Immediately  after  the  setting  of  the  loop 
counter,  multiplication  begins. 

Load  different 
addresses  of  X  and  Y- 
Buffers  (XYADR) 

A 

This  is  a  short  cut  to  load  different  X 
and  Y-Buffer  addresses.  Outputs  of 
value  A  to  channel  5  followed  by  a  value 
of  XXXX  to  channel  6  and  a  value  of  YYYY 
to  channel  6  will  set  the  X  and  Y  address 
pointers  to  XXXX  and  YYYY,  respectively. 
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are  summarized  in  Table  3-2.  To  illustrate  this,  the  selection  of 
multiplication  with  accumulation  function  is  considered.  First,  a 
value  7  is  outputted  from  PSP  channel  5  to  MULACC  which  signifies 
the  choice  of  functions.  Then  an  output  of  (33)g  from  channel  6 
sets  up  MULACC  to  perform  2's  complement  arithmetic,  multiply  (with 
no  rounding)  with  accumulation,  and  increment  both  X  and  Y  buffer 
pointers  after  each  operation.  Hence,  by  resetting  the  buffer 
pointers,  the  multiplier  can  be  started  to  perform  multiplications 
with  accumulation  of  two  arrays  of  numbers. 

When  the  multiplier  is  finished,  the  load  (RDY)  bit  of  MUIJiCC 
will  be  set  to  1  and  the  results  can  be  transferred.  Since  the 
PSP  is  a  16-bit  machine,  the  35-bit  product  is  shipped  via  three 
input  channels;  namely,  the  extended  product  (XTP)  through  channel 
10,  the  most  significant  product  (MSP)  through  channel  9,  and  the 
least  significant  product  (LSP)  through  channel  8.  In  addition, 
the  RDY  bit  is  multiplexed  with  the  actual  XTP  during  the  transfer. 
So,  after  reading  the  XTP,  the  PSP  has  to  perform  a  masking 
followed  by  a  shift  right  once  operation  in  order  to  obtain  the 
correct  XTP. 

Besides  the  products,  input  channels  8  and  9  are  also  utilized 
to  read  the  X  and  Y  buffer  memories.  A  tri-state  device  gating  on 
the  multiplier  RDY  bit  is  used  to  switch  the  outputs  from  the 
multiplier  to  that  of  the  memories.  In  particular,  when  RDY  =  0, 
input  channels  8,  9  are  connected  to  that  of  Y,  X  buffers,  respec¬ 
tively.  On  the  other  hand,  if  RDY  =  1,  input  channels  8,  9,  10 
are  hooked  up  to  LSP,  MSP,  XTP.  PSP  instructions  required  to  read 
products  and  memories  are  shown  in  Table  3-3.  Also,  special  con¬ 
siderations  on  loading  and  reading  of  MULACC  buffers,  together  with 
multiplier  functions  are  discussed  in  Table  3-4, 

To  further  illustrate  the  utility  of  MULACC,  a  PSP  demonstra¬ 
tion  program  that  computes  the  following  operation  is  detailed  in 
Figure  3-2: 

199 

y(0)  =  Z  h(k)x(0-k)  (3-1) 

k=0 
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TABLE  3-2  MULTIPLIER  FUNCTIONS 


BIT  5 

BIT  4 

BIT  3 

BIT  2 

BIT  1 

BIT  0 

BIT 

SETTING 
=  0 

DY 

DX 

NO 

RND 

NO 

SUB 

NO 

ACC 

USM 

BIT 

SETTING 
=  1 

lY 

IX 

RND 

SUB 

_ 

ACC 

TC 

Symbol  Definitions: 

USM:  Unsigned  magnitude  arithmetic  (16-bit  with  no  sign  bit) 

TC  :  2‘s  complement  arithmetic  (15-bit  with  1  sign  bit) 

ACC:  product  accumulation  (ACC  =  Product  +  ACC) 

SUB:  product  subtraction  (If  bit  2=1  &  bit  1=1,  ACC  =  PRODUCT  -  ACC) 

IX  :  increment  X-buffer  pointer 

DX  :  decrement  X-buffer  pointer 

lY  :  increment  Y-buffer  pointer 

DY  :  decrement  Y-buffer  pointer 

RND:  rounding  (see  Note  3  on  Table  3-4) 
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TABLE  3-3  INSTRUCTIONS  TO  READ  BUTTER  MEMORIES  AND  MULIIPL.ER  PRODUCTS 


INSTRUCTIONS 


PSP  MNEMONICS 


COMMENTS 


To  read  contents  of 
X-Buf.er  (XREAD) 


INPA  9. 


After  setting  up  the  X-Buffer  address 
using  XADR,  its  content  can  be  read 
after  a  waiting  period  of  4  cycles 


To  read  contents  of 
Y-Buffer  (YREAD) 


INPA  8. 


After  setting  up  the  Y-Buffer  address 
using  YAOR,  its  content  can  be  read 
after  a  waiting  period  of  4  cycles 


To  read  the  content  of 
extended  product 
(XTPR) 


INPA  10. 


To  read  the  content  of 
the  most  significant 
product  (MSPR) 


INPA  9. 


Following  either  a  load  loop  counter 
(LOOP)  or  read  multiplier  (READ) 
instruction,  the  extended  product  (XTP) 
can  be  accessed.  Only  the  lower  four 
bits  are  of  interest  and  the  upper  12 
bits  should  be  masked  out.  Its  format 
is  as  follows: 


I 

15 

3  2  1 

D 

LAII  I's- 

« - XTP — * 

.•-RDY*| 

where 

RDY=0:  multiplication  has  not 
been  completed 

RDY=1:  multiplication  has 
been  completed 

The  XTP  is  meaningful  only  if  RDY=1 


Following  either  a  load  loop  counter 
(LOOP)  or  a  read  multiplier  (READ) 
instruction,  the  most  significant 
product  (MSP)  can  be  read.  However 
the  MSP  value  is  valid  only  after  the 
multiplication  has  been  completed 
(i.e.,  RDY=1) 


To  read  the  content  of 
the  least  significant 
product  (LSPR) 


INPA  8. 


Following  either  a  load  loop  counter 
(LOOP)  or  a  read  multiplier  (READ) 
instruction,  the  least  significant 
product  (LSP)  can  be  read.  The  LSP 
value  is  valid  only  after  the  multi¬ 
plication  has  been  completed 
(i.e.,  RDY=1) 


TABLE  3-4 


NOTES  ON  PROGRAMMING  THE  MULACC 


1 .  Loading  of  X  and/or  Y-Buffer  Memories 

Since  the  loading  of  buffer  memories  takes  tnore  than  two  cycles,  a  4-cycle 
wait  period  is  needed  to  assure  the  success  of  writing  consecutive  memory  locations. 

2.  Reading  of  Buffer  Memories  and  Multiplier  Products 

The  MULACC  functions  include  the  readings  of  buffer  memories  and  the 
three  multiolier  products.  In  executing  instructions  that  do  not  involve  the 
multiplier,  the  product  outputs  are  disenabled  from  the  tri-state  control  and 
the  bus  is  connected  to  the  buffer  memories  which  allows  the  reading  of  their 
contents.  At  this  time,  the  multiplier  ready  bit  is  reset  (RDY  =  0).  How¬ 
ever,  by  outputting  a  read  multiplier  (READ)  instruction  on  channel  5,  the 
tri-state  bus  is  switched  to  the  multiplier  outputs.  This  set  the  RDY  bit  to 
1  and  enables  the  reading  of  existing  multiplier  products. 

In  performing  operations  that  require  the  multiplier,  the  RDY  bit,  ori¬ 
ginally  set  to  0,  will  change  to  a  1  inmediately  upon  their  completion.  At 
this  time,  the  three  products  will  be  ready  to  be  transferred  to  the  PSP 
using  instructions  shown  in  Table  3-2. 

3.  Multiply  with  Rounding 

The  TRW  chip  has  a  multiply  with  or  with  no  rounding  option.  The  multiply 
with  rounding  is  performed  by  adding  a  1  to  bit  15  of  the  least  significant 
product  and  the  rounded  result  is  obtained  by  reading  the  extended  and  most 
significant  products.  This  option  is  generally  not  recommended  for  multipli¬ 
cation  together  with  accumulation  since  it  yields  erroneous  results.  To  further 
illustrate  this,  the  multiplication  with  rounding  and  accumulation  of  two  arrays 
of  O's  shows  a  non-zero  final  value. 
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Block^of  the  program  chooses  the  multiplier  functions  whereas 
Block @ indicates  the  loading  of  the  X  and  Y  buffer  memories.  As 
shown  in  the  figure,  the  starting  addresses  of  the  buffers  are 
first  set  and  then  each  MULACC  buffer  location  can  be  individu¬ 
ally  loaded  from  PSP  memories.  By  resetting  the  buffer  pointers  (X  pointing 
to  h(0),  Y  pointing  to  x(0)),  the  loop  counter  is  initialized  to  be  zOO 
and  multiplication  with  accumulation  is  started  as  shown  in  Block 
.  The  multiplier  RDY  bit  is  constantly  checked  and  results  are 
transferred  to  PSP  as  illustrated  in  Block  O- 
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FIGURE  3-1  A  PSP  PROGRAM  TO  DEMONSTRATE  THE  UTILITY 
OF  MU L ACC 


SECTION  IV 
Real-Time  Software 

4,1  Introduction 

The  real-time  subband  coder  proqrams,  operating  at  the  data 
rates  of  9.6  and  16.0  Kb/s,  perform  subband  coding  of  speech  sig¬ 
nals  via  the  integer-band  sampling  technique  [3] .  Exploiting  a 
hich  speed  multiplier-accumulator  (MULACC) ,  the  9.6  Kb/s  program 
runs  in  a  full-duplex  mode  whereas  the  16.0  Kb/s  program 

operates  in  half-duplex  on  the  PSP.  A  summary  of  the  real-time 
9.6  and  16.0  Kb/s  subband  coder  specifications  is  shown  in  Tables 
2-1  and  2-2.  The  functions  in  the  real-time  program  include  speech/line 
side  interrupt  processing;  receiver/transmitter  synchronization; 
decimation/interpolation  filtering;  and  quantization/dequantiza¬ 
tion  of  subband  waveforms. 

Under  t''.c  speech  side  interrupt  control,  speech  samples  are 
brought  into  the  PSP  via  the  A/D  converter  and,  concurrently, 
processed  speech  is  outputted  through  the  D/A,  After  subband 
filtering  and  encoding  of  the  input  signal,  the  binary  bit  stream 
is  then  transferred  from  one  PSP  to  the  other  using  the  line  side 
interrupt . 

Within  each  frame  of  transmitted  data,  a  synchronization 
(SYNC)  bit  is  inserted  at  the  same  position  of  the  frame.  Through 
an  extensive  handshaking  procedure,  the  receiver  synchronizes 
with  the  transmitter  by  determining  the  position  of  the  SYNC  bit 
and  eventually  aligns  the  received  data  buffer  to  that  of  the 
transmitted  one.  In  the  real-time  programs,  the  sync  bit  is  sot 
alternatively  to  a  1  or  0  to  assure  a  more  reliable  synchroniza¬ 
tion  between  the  two  PSP's. 

A  major  portion  of  the  subband  coder's  processing  load  lies 
on  the  decimation  and  interpolation  filtering  in  the  main  line 
code.  At  the  transmitter,  this  involves  the  filtering  of  digitized 
speech  samples  using  a  bank  of  4  or  5  f inite-impulse-responsc 
(FIR)  filters^each  of  which  is  200-tap  long|  and  the  downsampling 
(decimating)  of  the  output.  These  decimated  subband  signals  are 
then  quantized  for  transmission.  At  the  receiver,  the  trans¬ 
mitted  values  are  dequantized,  up-sampled  (interpolated),  and 
filtered  through  the  same  bank  of  filters.  The  outputs  are  later 
combined  to  form  an  estimate  of  the  inv>ut  signal.  The  following 
sections  describe  in  more  detail  the  operation  of  each  program 
f  unct ion . 
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•1  •  ^  Speech,  Line  Side  Interrupts 

The  speech  side  interrupt  is  responsible  for  transferring 
speech  data  into  and  out  of  the  PSP's.  Its  interrupt  rate  is 
basically  controlled  by  a  programmable  real-time  counter  (OOOF  i  <, ) 
and  an  I-O  clock  whose  frequency  is  11.52  MHz.  Dividing  the 
frequency  of  the  clock  by  the  setting  of  the  counter  yields  the 
d  sirable  sampling  rate  of  the  speech  signal. 

At  every  occurrence  of  a  speech  side  interrupt,  the  program 
control  is  transferred  to  program  counter  =  0,  which  contains  an 
instruction  to  jump  to  the  speech  side  interrupt  servicing  routine 
where  a  speech  sample  is  inputted  to  the  speech  buffer  from  the 
A/D  converter  and  then  a  processed  one  is  outputted  to  the  D/A 
converter.  The  program  utilizes  a  double  buffering  scheme  on 
both  the  input  and  output  buffers  which  allows  the  processing  of 
a  frame  of  data  simultaneously  with  I/O  accessing  of  the  other. 

The  speech  side  interrupt  also  sets  flags  and  pointers  indicating 
which  buffers  are  being  loaded  or  transmitted.  Figure  4-1 
describes  in  detail  the  operations  of  the  speech  side  interrupt. 

Similar  to  the  speech  side  interrupt,  the  transfer  of  binary 
data  from  one  PSP  to  the  other  is  controlled  by  the  line  side 
interrupt  whose  rate  is  set  by  a  second  real-time  counter 
(OOOEie).  At  every  occurrence  of  the  interrupt,  program  control 
is  sent  to  the  line  side  interrupt  servicing  routine  starting  at 
program  counter  =  1  where  transmission  and  reception  of  bit 
streams  are  performed.  For  the  line  side  transmitter,  the  PSP 
hardware  is  set  up  in  such  a  way  that  it  takes  two  interrupts  to 
generate  1  transmitter  clock  cycle  (a  high  clock  bit  followed  by 
a  low  one) .  At  the  same  time,  the  identical  data  bit  in  outputted 
throughout  the  entire  clock  cycle.  So,  in  order  to  achieve  a  par¬ 
ticular  data  rate,  the  line  side  has  to  be  interrupted  at  twice  the  dat 
rate.  For  example,  the  line  side  interrupt  rate  has  to  be  set  at  32 
KHz  for  16  Kb/s  transmission.  As  for  the  line  side  receiver,  the 
routine  inputs  an  8-bit  word  on  every  16th  interrupt.  Then  each 
of  these  bits  is  unpeeled  and  a  histogram  of  all  receiver  bits  is 
compiled  counting  the  number  of  times  a  1  is  received  in  each  bit 
position.  The  histogram  information  is  later  used  during  the  sync 
search.  A  flow  chart  description  of  the  line  side  interrupt  is 
depicted  in  Figure  4-2. 
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4.3  Initialization  and  Wait  Loop 

In  the  operation  of  the  real-time  program,  the  very  first 
function  is  the  initialization  routine  which  sets  most  of  the 
program  variables  prior  to  processing.  All  flags  and  storage 
buffers  are  cleared  together  with  loading  of  the  line  and  speech 
side  real-time  counters.  Band-pass  filter  coefficients  needed 
to  perform  decimation  and  interpolation  filtering  for  all  subbands 
are  loaded  into  the  X-buffer  of  the  high  speed  multiplier-accu¬ 
mulator  (MULACC) .  Figure  4-3  illustrates  the  storage  of  these 
coefficients.  Also,  quantizer/dequantizer  parameters  and  index 
registers  are  loaded  with  their  initial  values.  Upon  exiting 
from  the  initialization  routine,  the  speech  and  line  side  inter¬ 
rupts  are  enabled  and  program  control  is  then  transferred  to  the 
wait  loop. 

Figure  4-4  is  a  flow  chart  description  of  the  wait  loop. 

Since  not  all  of  the  available  processing  time  is  used  in  the 
real-time  subband  coder  programs,  the  wait  loop  consumes  all  the 
idle  time  left  over.  It  also  correctly  sequences  the  transmitting 
and  receiving  operations  by  examining  the  transmitter  and  receiver 
ready  flags  and  interrogates  the  front  panel  to  see  if  reinitiali¬ 
zation  is  desired. 

4.4  Transmitter/Receiver  Synchronization 

After  quantizing  the  subband  signals,  the  transmitter  data 
buffers  in  the  real-time  program  are  set  up  to  have  unused  bits. 

In  particular,  there  are  5  bits  left  over  in  the  9.6  Kb/s  and  4 
in  the  16.0  Kb/s  systems.  One  of  these  available  bits  is  allo¬ 
cated  in  the  programs  for  synchronization  purposes.  This  bit, 
generally  referred  to  as  the  SYNC  bit,  is  set  to  a  1  or  0  every 
alternate  frame.  Utilizing  a  search  procedure  for  this  SYNC 
bit,  the  receiver  is  able  to  locate  the  beginning  of  the  received 
buffer  for  proper  decoding  of  the  transmitted  data.  The  sync 
search  routine  is  performed  on  every  80th  pass  (or  675  msec) 
through  the  receiver  codes  in  the  16.0  Kb/s  system  and  on  every 
40th  pass  (or  750  msec)  in  the  9.6  Kb/s  system.  During  the 
remaining  times,  the  receiver  line  side  compiles  a  histogram 
which  tallies  the  times  that  a  1  has  been  received  in  each  bit 
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FIGURE  4-5  FLOW  CHART  OF  THE  SYNCHRONIZATION  ROUTINE 

4-9 


position  within  the  two  data  buffers.  The  sync  position  is  then 
located  by  computing  the  absolute  difference  between  the  histogram 
values  of  the  present  bit  position  and  that  of  the  bit  situated 
one  frame  apart.  The  bit  position  that  corresponds  to  the  largest 
difference  is  defined  as  the  sync  point.  To  assure  that  the 
correct  sync  point  has  indeed  been  found,  sync  search  is  repeated 
two  more  times.  If  the  sync  point  is  identical  in  all  3  cases, 
the  true  one  is  located.  Otherwise,  the  sync  search  is  resumed 
until  three  identical  sync  points  are  consecutively  found.  After 
achieving  synchronization,  the  sync  search  is 

still  performed.  A  sync  drop  counter  is  utilized  to  keep  track 
of  the  failure  of  synchronization.  As  a  matter  of  fact,  if  the 
two  PSP’s  are  out  of  sync  five  times,  resynchronization  is  auto¬ 
matically  repeated.  Figure  4-5  is  a  flowchart  description  of 
the  synchronization  routine. 

4,5  Decimation  Filtering 

Once  the  initialization  and  synchronization  procedures  have 
been  completed,  the  PSP's  are  ready  to  process  the  input  speech. 

As  described  before,  the  transmitter  ready  flag  (FLAG)  is  con¬ 
stantly  examined  in  the  wait  loop  to  determine  when  a  frame  of 
new  speech  data  has  been  loaded.  When  FLAG  is  set,  program  con¬ 
trol  is  transferred  to  the  transmitter  routine  whose  major  func¬ 
tions  include  decimation  filtering. 

Upon  entering  the  transmitter  code,  a  sync  pulse  is  outputted 
for  timing  purposes  and  FLAG  is  reset.  Then  the  filter  nemorios 
arc  updated  together  \7ith  the  loading  of  a  new  frame  of  speech  data. 

The  decimation  filters  decompose  the  incoming  speech  signal 
into  four  (in  the  9.6  Kb/s  system)  or  five  (in  the  16.0  Kb/s 
system)  subbands.  The  integer  band  down-sampling  method  is  used 
to  perform  the  low-pass  translation  of  each  subband.  The  speech 
samples  stored  in  the  decimation  filter  buffers  are  loaded 
into  the  Y-array  of  MULACC  as  shown  in  Figure  4-6.  It  is  only 
necessary  to  load  this  buffer  once  every  frame. 

After  the  X-  and  Y-buffers  of  MULACC  are  loaded,  the  following 
filtering  operation  is  implemented  for  each  subband: 
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filter 

memories 


Illustrate  this,  the  multip] ication  with  rounding  and  accumulation  of  two  arrays 
of  0‘s  shows  a  non-zero  final  value. 
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FIGURE  4-6  LOADING  OF  THE  Y-BUFPER  FOR  DECIMATION  FILTERING 
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y(nL)  =  I  h(k)x{nL-k)  (4-1) 

k=0 

where  L  is  the  decimation  ratio,  h(k)  is  the  kth  filter  coeffi- 
ci  jnt,  x(')  is  tne  input  sample,  and  y(')  is  the  filtered  output 

Initially,  MULACC  is  setup  to  perform  multiply  and  accumulate 
with  no  rounding;  increment  X-buffer  and  decrement  Y-buffer 
pointers.  The  X-buffer  pointer  is  directed  to  the  first  filter 
coefficient,  h(0),  of  band  1  and  the  Y-buffer  one  is  set  to 
point  at  x(0)  which  is  stored  at  the  199th  location  of  the  Y-buffer. 
By  setting  the  loop  counter  to  be  equal  to  200,  the  filtering 
operation  is  started.  After  MULACC  finishes,  the  ready  bit 
in  the  extended  product  (XTP)  is  set  and  the  result  is  trans¬ 
ferred  to  PSP.  Incrementing  the  Y-pointer  by  a  factor  of  L  and 
resetting  the  X-pointer  to  0*  computation  of  the  second  sample  is 
performed.  Since  it  takes  MULACC  42  msec  to  compute  each  output 
point,  the  PSP  is  free  to  perform  other  functions.  In  the 

real-time  programs,  this  time  is  used  for  the  quantization  of  the 
previously  computed  sample,  thereby  hiding  the  amount  of  time 
needed  for  quantization. 

When  the  point  y(nL)  is  computed,  the  X  and  Y  buffer  pointers 
are  reset  to  compute  the  point  y((n+l)L).  If  all  the  points  for 
the  particular  band  have  been  calculated,  program  control  is 
transferred  to  the  filter  loop  of  the  next  subband.  Once  the 
filtering  for  all  bands  is  complete,  the  program  leaves  for  the 
encoder  routine.  Figure  4-7  is  a  flow  chart  that  describes  the 
operation  of  the  decimation  filters  for  the  9.G  Kb/s  system.  Tlie 
corresponding  codes  for  the  lo.O  Kb/s  system  are  similar. 
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FIGURE  4-7  FLOW  CHART  OF  DECi;iATION  FILTERING 


4.6  Quantization  and  Encoding 

After  the  band-pass  filtering  and  decimation  operations,  the 
resulting  subband  signals  are  converted  to  binary  bit  streams  using 
adaptive  PCM  (APCM)  quantizers.  Employing  the  adaptation  strategy 
f  one-word  memory,  the  quantizer  dynamically  adiusts  its  step 
sizes  according  to  the  amplitude  of  the  input  signal.  Maxi¬ 
mum  and  minimum  step  sizes,  adaptation  and  gain  factors  for  the 
quantizers  are  summarized  in  Table  2-3.  As  described  earlier,  the 
quantization  is  done  within  the  subband  filtering  loop  in  the  pro¬ 
gram  to  minimize  processing  time. 

Since  the  adaptive  quantizer  is  known  to  perform  poorly  in 
the  presence  of  channel  errors,  the  robust  algorithm  as  described 
in  Appendix  D  is  incorporated.  In  the  actual  implementation, 
the  decaying  factor  B  is  chosen  to  be  0.96  and  the  function  , 
where  A  is  the  quantizer  step-size,  is  estimated  using  a  poly¬ 
nomial  approximation  procedure  as  described  in  Appendix  E  . 

To  further  reduce  processing  time,  only  a  3rd  order  polynomial 
is  utilized.  The  flow-chart  of  an  8-level,  robust  quantizer 
is  shown  in  Figure  4-0. 

After  quantizing  the  subband  signals,  the  resulting  bit  stream 
is  encoded  and  stored  in  the  appropriate  output  buffer  as  depicted 
in  Figure  4-9.  The  formats  of  the  transmission  data  for  both 

9.6  and  16.0  Kb/s  are  shown  in  Figures  4-10(a)  and  (b) .  Upon  fin¬ 
ishing  the  encoding  process,  all  transmitter  functions  have  been  com¬ 
pleted.  Then  a  sync  pulse  is  outputted  to  denote  the  end  of  all  trans¬ 
mitter  functions  and  program  control  is  transferred  back  to  the  wait  loop. 

4.7  Decoding  and  Dequantization 

When  the  receiver  ready  flag  (LNSW)  is  set  equal  to  1,  the 
pto«jram  will  leave  the  wait  loop  and  go  to  the 
r-'civer  code.  Decoding  of  the  received  data  as  shown  in  Figure 
I  !  IS  t  h»*  first  function  performed.  Through  this  procedure,  the 
.  I  V:  •  reconstructs  code  words  from  the  data  buffer  each  of  which 
c  o  s  to  a  transmitted  sample. 

c  f  *  h«*  decoding,  dequantizer  routines  are  employed  to 
0  -rived  code  words  to  their  corresponding  values.  A 
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flow  chart  description  of  an  8-level  robust  dequantizer  is  depicted 
in  Figure  4-13.  Whenever  the  dequantizer  step  size  A  falls  below 
a  pre-determined  threshold  value  Ath,  a  mid-rise/mid- tread  switch, 
as  described  in  Section  2,  is  used  to  eliminate  the  tones  gener¬ 
ated  during  idle  channel  conditions. 

4.8  Interpolation  Filtering 

After  dequantizing  the  received  samples,  they  are  stored  in 
the  approporate  filter  buffers.  Since  the  received  sample 
sequence  contains  information  on  the  original  signal  only  at 
intervals  of  L  samples,  where  L  is  the  decimation  (interpolation) 
ratio,  the  remaining  L-1  samples  are  generated  by  interpolation 
filtering  at  the  receiver  as  shown  in  Figure  4-13. 

The  real-time  interpolation  filter  code  implements  the 
following: 


y  (n) 


199 

E 

k=0 


h(k)x|2^j 


(4-2) 


whore  fcj  is  defined  as: 


if  C  is  an  integer 
otherwise 


(4-3) 


As  discussed  in  Section  2,  it  is  not  necessary  to  perform 
200  multiplies  to  obtain  each  output  sample  owing  to  the  fact  that 
a  lot  of  the  input  points  x(*)  are  zero.  For  the  sake  of  clarity, 
the  following  interpolations  of  Band  1  signal  in  the  9.6  Kb/s 
system  are  examined  in  detail  (L=20) : 


y(0)  =  h(0)  X  (0)  +  h(20)  x  (-1)  +  ...  +  h(180)  x  (-9) 
y(l)  =  h(l)  X  (0)  +  h(21)  x  (-1)  +  ...  +  h(181)  x  (-9) 

y(19)  =  h(19)  X  (0)  +  h(39)  X  (-1)  +  ...  +  h(199)  x  (-9) 
y(20)  =  h(0)  X  (1)  +  h(20)  x  (0)  +  ...  +  h(180)  x  (-8) 
y(21)  =  h(l)  X  (1)  +  h(21)  X  (0)  +  ...  +  h(181)  x  (-8) 
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y(39)  =  h(19)  X  (1)  +  h(39)  x  (0)  +  ...  +  h(199)  x  (-8) 


y(179)  =  h(19)  X  (8)  +  h(39)  x  (7)  +  ...  +  h(199)  x  (-1) 

It  is  obvious  from  the  above  equations  that  only  200/20  multipli¬ 
cations  are  needed  to  calculate  each  y(n) .  Furthermore,  by 
arranging  the  filter  coefficients  and  input  seunples  in  a  special 
fashion,  as  shown  in  Figure  4-11,  the  output  values  can  be 
efficiently  computed  using  the  MULACC. 

Initially,  the  Y-buffer  pointer  is  set  to  X(0)  and  the 
X-buffer  pointer  is  set  to  hl0) ,  The  multiplier  is  set  up  to 
perform  ten  multiply-and-accumulates  with  both  the  X  and  Y 
pointers  incrementing  after  each  operation.  When  MULACC 
finishes,  y(0)  is  computed.  Then  with  the  X-buffer  pointer 
directing  at  h(l)  and  resetting  the  Y-buffer  pointer  to  x(0) , 

MULACC  is  restarted  which  later  yields  yd)  . 

The  process  is  iterated  for  L-2  times  and  y(2) ,  y(3)  ... 
y(19)  are  computed.  Then  the  X-buffer  pointer  is  reset  to  h(0) 
and  the  Y-buffer  pointer  is  set  to  x(l)  .  By  repeating  the  above 
procedure,  all  interpolator  outputs  are  computed.  Tallying  up  the 
multiplies  involved,  it  is  noted  that  the  total  number  is  roughly 
comparable  in  both  interpolation  and  decimation  filtering.  However, 
a  closer  excimination  of  the  real-time  implementation  reveals  that 
that  symmetry  of  the  FIR  filters  has  not  been  exploited.  This  can 
be  explained  by  the  fact  that  progreunming  of  MULACC  is  more  effi¬ 
cient  when  the  filter  coefficients  are  strung  out.  Furthermore, 
the  speed  of  MULACC  is  capable  of  performing  full-duplex  9.6  Kb/s 
subband  coder  and  the  more  complicated  16.0  Kb/s  system  in  a  half¬ 
duplex  mode. 

When  interpolation  filterings  for  all  subbands  are  finished, 
the  resulting  points  are  added  together  before  being  loaded  into 
the  output  buffer.  Then  a  sync  pulse  is  generated  to  signify  the 
end  of  the  receiver  function  and  the  program  returns  to  the  wait 
loop. 
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FIGURE  4-14  STORAGE  OF  X  AND  Y-BUFFERS  FOR  INTERPOLATION  FILTERING 
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4.9  Operating  Procedures  of  the  Real-Time  Subband  Coder  Programs 

4.9.1  9.6  K^'/s  oror*ra"' 

I .  LOADING 

Load  the  program  SBC96.PBJ  into  both  PSP's  using  the  PSP 
Loader  Program  PSPLD. 

II.  OPERATING  PROCEDURES 

After  the  program  has  been  loaded,  depress  bit  15  of  PSP 
front  panel  switches.  Then  start  the  subband  coder  program  by 
setting  the  RUN/STOP  switch  to  RUN.  In  this  mode,  the  program 
will  loop  around  an  initialization  routine  which  clears  out 
data  buffers  and  resets  loop  registers.  When  bit  15  is  placed 
in  the  up  position,  the  software  will  first  enable  synchroniza¬ 
tion  between  the  two  PSP's  and  then  proceed  to  process  speech 
using  the  subband  coder  algorithm.  If  re-synchronization  is 
desired,  hold  bit  15  of  both  PSP's  in  the  down  position  and 
return  it  one  at  a  time  to  the  up  position.  Options  of  the  9.6 
Kb/s  subband  coder  program  are  summarized  as; 

bit  15  DOWN;  INITIALIZATION 

UP ;  NORMAL  OPERATION 

bit  12  DOWN;  ROBUST  QUANTIZER 

UP;  ADAPTIVE  JAYANT  QUANTIZER 
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4.9.2  16.0  Kb/s  Program 


I .  LOADING 

Load  the  program  SBC16.PBJ  into  both  PSP's  using  the  PSP 
Lioader  Proqrcun  PSPLD. 


II.  OPERATING  PROCEDURES 


After  the  program  has  been  loaded,  depress  bit  15  of  PSP 
front  panel  switches.  Then  start  the  subband  coder  program  by 
setting  the  RUN/STOP  switch  to  RUN.  In  this  mode,  the  program 
will  loop  around  an  initialization  routine  which  clears  out 
data  buffers  and  resets  loop  registers.  When  bit  15  is  placed 
in  the  up  position,  the  software  will  first  enable  synchronlza-- 
tion  between  the  two  PSP's  and  then  proceed  to  process  speech 
using  the  subband  coder  algorithm.  If  re-synchronization  is 
desired,  hold  bit  15  of  both  PSP's  in  the  down  position  and 
return  it  to  the  up  position.  Since  the  16  Kb/s  subband  coder 
program  only  works  in  the  half-duplex  mode,  the  push-to-talk 
switch  on  the  handset  has  to  be  depressed  to  allow 
speech  transmission.  Options  of  the  16.0  Kb/s  subband  coder 
program  are  summarized  as; 


bit  15 


DOWN:  INITIALIZATION 

UP:  NORMAL  OPERATION 


bit  14 


DOWN;  TRANSMITTER 
UP:  PUSH-TO-TALK 


bit  13 


DOWN;  RECEIVER 

UP;  PUSH-TO-TALK 


bit  12 


DOWN:  ROBUST  QUANTIZER 

UP:  ADAPTIVE  JAYANT  QUANTIZER 
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SECTION  V 

Conclusions  and  Recommendations 


5.1  Conclusions 

This  contract  has  resulted  in  the  development  of  ft  9.6  and  a 
i.6 . 3  Kb/s  real-time  subband  coder  which  employ  the  integer-band 
sampling  technique.  Informal  listening  tests  indicate  that  the 
speech  quality  of  the  subband  coders  is  slightly  "hollow"  and 
noisy  especially  for  the  9.6  Kb/s  system,  but  its  overall  quality 
is  significantly  better  than  that  of  CVSD  at  the  same  data  rate. 
However,  when  compared  to  other  high  quality  16  Kb/s  speech  en¬ 
coding  schemes,  such  as  the  Adaptive  Predictive  Coder  with 
Adaptive  Quantization  (APCQ) ,  the  subband  coder  is  clearly 
inferior.  GTE  Sylvania  attributes  this  to  the  fact  that  the  four 
or  five  bandpass  filters  employed  by  the  subband  coder  algorithm 
are  not  adequate  to  represent  the  entire  speech  spectrum.  This 
introduces  spectral  "gaps"  which  manifest  the  "hollowness"  in  the 
processed  speech.  Moreover,  the  ratio  of  subband  energies 
fluctuates  largely  from  freune  to  frame  and  quantization  of  them 
using  adaptive  PCM  (APCM)  with  fixed  bit  allocations  results  in 
noisier  output  speech. 

In  light  of  the  inadequacy  of  the  integer-band  sampling 
ADoroach,  GTE  Svlvmia  developed  an  improved  subband  coding  scheme 
using  quadrature  mirror  filters  (QMF) .  An  adaptive  bit  allocation 
scheme  is  also  incorporated  to  quantize  the  subband  signals. 

FORTRAN  simulations  show  that  no  significant  distortions  are  intro¬ 
duced  in  the  bandsplitting/reconstruction  process  using  QMF  filters 
and  this  results  in  "smoother"  quality  processed  speech.  Further¬ 
more,  the  adaptative  bit  allocation  scheme  enables  a  more  effi¬ 
cient  encoding  of  the  subband  waveforms  which  eliminates  a  lot  of 
quantization  noise  in  the  outputs  speech. 

5.2  Recommendations 

Encouraged  by  the  latest  results,  GTE  Sylvania  strongly 
recommends  the  subband  coding  algorithm  with  the  QMF  approach  be 
studied  further  and  be  implemented  especially  at  16  Kb/s.  More- 
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over,  since  present  subband  coders  are  only  designed  to  capt'-re 
the  spectral  shape  of  the  speech  signal,  GTE  Sylvania  believes 
that  the  speech  quality  of  the  QMF  method  can  be  further  improved 
by  exploiting  the  fine  structures  of  the  speech  spectrum.  This 
is  also  reinforced  by  the  findings  of  a  recent  study  which  incor¬ 
porates  pitch  information  in  performing  subband  coding  of  speech 
signals  ri4]. 
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mPPLnjJIX  c 


TABULATION  OF  THE  60-TAP  QMF  IMPULSE  RESPONSE 


HI  1)— O.eO4560OlE-lO 
2U-0.r2466006E-0e 

H(  3)-  0. 3393791 2E-08 
H(  4)*  0.125S5994E-07 
H(  5)— 0.S48X62B4E-07 
H(  6)— 0.797731 59E-07 
H(  7)-  0.S6410033E-0& 
H(  8)^  0.19332403E-04 
H(  9)— 0.40112018E-05 
H<  10)»  0.138S9838E-05 
H(  11 )>  0.20943698E-04 
H(  12)— 0.18127259E-04 
H(  13)»-0.83554602E-04 
H(  14>a  0.11246781E-03 
H(  15 0.26012788E-03 
H(  16)»-0.49234246E-03 
H(  17)=-0.63178962E-03 
H(  18)-=  0.1 688084  lE-02 
H(  19)=  0.11479852E-02 
H(  20)=-0.47692582E-02 
H(  21>=-0.12668951E-02 
H(  22)=  0.11493831E-01 
H(  23)=-0.66263479E-03 
H(  24)=-0.24424994E-01 
H(  25)=  0.84577305E-02 
H(  26)=  0.48056692E-01 
H<  27)=-0.31792097E-01 
H(  28)=-0.98794423E-01 
H(  29)=  0.12183660E-t-00 
H(  30)=  0.46986365E+00 
H<  31)=  0.4698636SEi00 
H<  32)=  0.12183660E-f00 
H(  33)=-0.98794423E-01 
H(  34)=-0.31792097E-01 
H(  35)=  0.48056692E-01 
H(  36)=  0.84577305E-02 
H<  37)=-0.24424994E-01 
H(  38)=-0.66263479E-03 
H(  39)=  0.11493831E-01 
H(  40)=-0.12668951E-02 
H(  41)*-0.47692582E-02 
H<  42)=  0.11479852E-02 
H(  43)=  0.16880841E-02 
H<  44)=-0.6317e962E-03 
H(  45)=-0.49234246E-03 
H(  46)=  0.2601278eE-03 
H<  47)=  0.11246781E-03 
H<  48)=-0.83554602E-04 
H(  49)»-0.18127259E-04 
H<  50)=  0.20943698E-04 
H<  51)=  0.1385983eE-05 
H<  52)— 0.401120ieE-05 
H<  53)=  0.19332403E-06 
H(  54)-  0.56410033E-06 
H<  55)— 0.79773 159E-07 
H<  56)— 0.548662e4E-07 
M<  57)-  0.125SS994E-07 
H(  58)=  0.33937912E-08 
H(  59)— 0.12466006E-08 
H(  60)— 0..80456801E-10 
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App>endix  0 
The  Robust  Quantizer 


The  subband  coder  employs  adaptive  PCM  (APCM)  quantizers  with 
the  Jayant  algorithm  which  performs  well  in  a  back-to-back  mode 
without  channel  errors.  Unfortunately,  channel  errors  signifi¬ 
cantly  impair  performance.  To  see  this,  consider 

"  ”‘'k>  'Jk-i  •  "'h'  % 

where  is  the  ith  transmitted  character  and  M(I^)  is  the  multi¬ 
plier  corresponding  to  I^. 

Assume  now  that  one  transmission  error  is  made  at  time 
causing 

to  become 

and 

to  become 

afterwards, 

the  new  quantizer  levels  for  time  greater  than  t  become 


»iii' 


M(I,) 


or,  simplifying,  we  obtain 

**k'  “  ‘^k  M(Ijj) 


(D-2) 


(D-3) 


Consequently,  the  effects  of  even  a  single-channel  error  never 
die  out.  In  fact,  an  error  causes  a  level  shift  in  qj^  by  the  factor 
«(In')/M(Ij) .  Simulations  show  that  errors  cause  noticeable  fades 
and  increases  in  output  level.  Only  when  the  quantizer  saturates 


D-1 


at  its  maximum  level  or  cuts  off  at  its  minimum  level  does  the 
quantizer  again  track  properly. 


By  incorporating  a  small  decaying  factor  in  the  Jayant  algo¬ 
rithm,  the  resulting  quantizer  can  be  made  more  robust  to  channel 
errors.  This  modification  calls  for  the  raising  of  the  last 
quantizer  step  size,  qv-i'  *  power  S  as  follows 


where  ti  ( • )  represents  the  product  symbol,  is  the  ith  trans 
mitted  symbol,  and  M{I^)  is  the  multiplier  corresponding  to  I 
Again,  if  one  channel  error  occurs  at  time  i 


or  simplifying 


Assuming  6  <  1,  then  for  times  k  far  removed  from  K.  (k  >>  £) 


The  speed  of  this  convergence  depends  on  how  near  6  is  to  1.  If 
8  is  not  too  close  to  unity,  convergence  is  rapid.  Consequently 
the  effect  of  a  single  error  decays  with  time. 


APPENDIX  E 

EVALUATING  TRANSCENDENTAL  FUNCTIONS 
WITHOUT  DIVISION 


A.  Arcese  and  A.  J.  Goldberg 
ABSTRACT 

We  consider  least  square-error  polynomial  approximations  to  trancendental 
ftinctions  over  a  finite  interval.  These  approximations  are  useful  when  computer 
division  times  greatly  exceed  multiplication  times.  Examples  are  given  for  1/x, 
>/5r,  e"*,  and  In  x. 


The  authors  are  with  GTE  Sylvania,  Electronics  Systems  Group,  Needham 
Heights,  Massachusetts,  02194. 
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(5) 


where  i  *  1, 2,  . . . ,  a  is  the  solution  column  vector 

^  •••’%] 

Table  I  gives  the  coefficients  for  a  third  degree  approximation  to  1/x,  vGT, 
e“*,  and  In  x  over  the  interval  (0.5, 1.0).  For  1/x  andlnx  it  was  necessary  to 
subdivide  the  interval  to  lower  the  maximum  error.  Figures  1  through  5  give  the 
percent  error 


percent  error 


X  100 


(7) 


for  each  of  the  functions. 


TABLE  I 

TABLE  OF  LEAST  SQUARE  COEFFICIENTS 

^  2  3 

fW  -  +  a^x  +  ag*  +  agX 


f(jc) 

»2 

»3 

Range  of  x 

1/x 

4.895 

-7.918 

4.233 

(0.5,0.75) 

2.626 

-1.081 

-1.816 

1.273 

(0.75, 1.0) 

vT 

0.2613 

1.116 

-0.5193 

0.1423 

(0.5, 1.0) 

e-* 

1.060 

-1.238 

0.7905 

(0.5, 1.0) 

In  X 

-2.222 

4.392 

-3.167 

(0.5,0.95) 

-1.007 

0.4711 

1.071 

-0.5358 

(0.95,1.0) 

%  ERROR  VS  X 


FIGURE  1 


MAXIMUM  ERROR  -  >  0.011% 


%  ERROR  VS  X 


E-8 


3.00 


